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Isolating and characterizing CLONES 



Tm 13.17 
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Used for RI ASSAY 
Development 

Detailing 

*Assay specificity 

*Assay sensitivity 

^Numerical quantitation 
-ice grain size 
-dilution profiles 
-regression analysis 
-RI Factor estimates 
-Slope comparisons 

*Reference comparisons 
to Tm 12.86 profiles 

^Mathematical modeling 

^Automation 
-multiple samples 
-image analyses 

*Light Scattering method 
for assessing RI 

^Applications 
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3. Excise pBK-CMV phagemid 
containing cloned DNA insert 
by co-infection with helper phage 



bla' fi (-) origin (n+T) 



SV40 polyA 




Ssp I 4410 



MstU 4388 
Ate/14303 
/Sis/ I 4231 



A/ael 134 



Ssp I 445 
Af/ul463 



Nae\ 3369 



Nae I 3086 



Afsfll2756 




SV40 poly(A) 
SV40 3 1 splice 



T7 

5' splice 
Kpn \ 1019 

Sac\ 1126 
T3 



Affiel 1300 
A/s/l 1900 



eit fceverte Crt»er 
5 AtAGGAAAC*GCTAT6ACCfTG 3 



T3 PH»*r 
5' MTTA4CCCTCACTAAA606 3* 



MET 



13 promoter - 



Pst i 

Sac I fis*l II £?M I Ml 



1200 

I _ 

5* TC*t*tA<^AAtAfiCT4TSACCrTa*TTAC6C£AA^ 
3* ASTGTCTCtm&rtSATACTSSMtTA^^ 

1183 



«S#C6 I 8$tX I 

mm ill "*$& I &9 l Hot l Apa I CU I Sw 1 ifert I 

MTTCAMM^TTCtC$ASASTACrKfASA$CGOt^^ 3 
6TmrC6AA£A SC TCTC AISAAC^TCTC&CC&^SCCCCSCTASC T A AAA^T&OGCCCACCCC A f GGTCC A TTCACATGG6T f aA^C&GGATATCACTCAGCaTAATGTT AAGTGACCGGC AGC AAAATST S 

4i- +1 T7 prmottr * 

3' C665AIATCACTCAGCATAATG S" 3" TGACCGStAGCAAAATG 5' 
T7 Primer HU-ZQ Primer 



DNA sequence of Tm 13.17 cDNA clone 
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A. Mature Tm 13.17 amino acid r sidure 



1 LTEAQIEKLN KISKKCQNES GVSQEIITKA RNGDWEDDPK LKRQVFCVAR 
51 NAGLATESGE WVDVLREKV RKVTDNDEET EKIINKCAVK RDTVEETVFN 



101 TFKCVMKNKP KFSPVD 



B. Summary of the composition analysis for the mature Tm 13.17 
O sequence: 
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S J Residue Number Mole Percent 
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= Ala. 
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5.172 
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0 . 000 
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= Cys 
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3.448 
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= Asp 
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6.897 
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= Glu 
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11.207 
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Molecular weight = 13171 .96; Residues = 116; Average Residue Weight = 1 13.551 
Charge = 1 ; Isoelectric point = 7.74. 
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Tm 13/17 3 EAQIEKLNKISKKCQNESGVSQEIITKARNGDWEDDPKLKRQVFCVARNA 52 

I-- III-- I- M-MIMM :.|.|| : llllll :-lh =-l 
AFP- 3 1 ETPREKLKQHSDACKAESGVSEESLNKVRNREEVDDPKLKEHAFCILKRA 50 

Tm 13.17 53 GI^TESGEVVVDVLREKVRKVTDNDEETEKIINKCAVKRDTVEETVFNTF 102 

|: -.Ml. :| ::.|.:. | | j | || : || .: : | 

AFP- 3 51 GFIBASGEFQLDHIKTKFKENSEH'PEKVDDLVAKCAVKKDTPQHSSADFF 100 

Tm 13.17 103 KCVMKNKP 110 

IN • I : - 
AFP- 3 101 KCVHDNRS 108 



Percent identity: 39.8 (identical amino acids; Percent similarity: 58.3 (identical amino 
acids plus conservative amino acids). 
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2-2 



1 GGCACGAGCAAAA 1AT G| A A ACTCCTCTTGTGCTTTGCGT TCGCCGCC 

M K L L L C F A F A A 



47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I V I G A Q A AL T D E Q I Q K 



war 



fj I 



few 



92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
R N K I S KECQQVS GVS 

137 CAAGAGACGATCGACAAAGTC C6CACA GGTGTCTTGGTCGATGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKK HVLCF SKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEA G DTNV EVLKA 

271 AAGCTGAAGCAT GTGGCCAGCGACGAAGA G G TGGACAAGATCGTG 
KLKHVASOEEV D K I V 

316 CAGA AGT GCGT G GT C A A G A A G G C C A C A C C A G A G G A A A C G G C T T A T 
QKCVV KKATP EETAY 

361 GACACCT TCAAGTGTATTTAC G A CAGCAAACCTGATTTCTCTCCT 
D TFK CI YDSKPDFS P 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTT GA CAAT AAA GGT 
ID* 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 
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1 GGCACGAGCAAAA kT Gl A AACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
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92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
R N K I S K E C QQV S GV S 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVL VDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGT GGACAAGAT CGTG 
KL KHVAS DEEVD K! V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 G A CACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKCI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTT GAC A AT A A A GGT 
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polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 



poly (A) tail 



ft 



2-2 
2-3 



G 
G 



C A 
C A 



Start 
4 

AAAATGAAACTCCTCTTGTGCTTTG cIg] 
AAAATGAAACTCCTCTTGTGCTTTG CT 



2-2 TTCGCCGCCATCGTCATCGGAGGTCAGGCTCTCACCG 
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C i 2-2 A C GA A C A GA T ACAGAAAAGGAACAAGATCAGCAAAGA 

5 2-3 A C G A A C A G A T ACAGAAAAGGAACAAGATCAGCAAAGA 

S] 2-2 A T G C C A G C A G GTGTCCGGAGTGTCCCAAGAGACGATC 

m 2-3 A TGCCAGC AG GTGTCCGGAGTGTCCCAAGAGACGATC 

SI 

« 2-2 GA CA A A G T C C GCACAGGTGTCTTGGTCG A[f]G A T C C C A 

01 2-3 GACAAAGT CCGCACAGGTGTCTTGGTCG a|cJg A T C C C A 

□ 2-2 A A A T GA A GA A GCACGTCCTCTGCTTCTCGAAGAAAAC 
Pi 2-3 A A A T GA A GA A GCACGTCCTCTGCTTCTCGAAGAAAAC 

FJ 2-2 T GGAGT GGCAACCGAAGCCGGAGACACCAATGTGGAG 

□ 2-3 T GGAGT GGCAACCGAAGCCGGAGACACCAATGTGGAG 

rl 1 

2-2 GT ACT CA A A GCCAAGCTG. AAGCATGTGGCCAGCGACG 

2-3 GT ACT C A A A GCCAAGCTGAAGCATGTGGCCAGCGACG 
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GT GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 
GT GGA CAAGATCGTGCAGAAGTGCGTGGTCAA 



2-2 GA A GGC C A C A CCAGAGGAAACGGCTTATGACACCTTC 
2-3 G A A G G C C A C A CCAGAGGAAACGGCTTATGACACCTT C 



2-2 AAGT GT AT T T ACGACA G C A AACCTGATTTCTCTCCTA 
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Whole Protein 


Molecular Weight 


12843.80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040*5% 


1 A(280) = 


4.22 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 
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41 74 
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17.39 
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20 
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17.39 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILFWV) 


34 
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29.57 


A Ala 
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3.32 
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3.48 


D Asp 
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G Gly 
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M Met 
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N Asn 
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1.78 


1.74 


P Pro 
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3.48 


Q Gin 
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5.98 


5.22 


R Arg 
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2.43 


1.74 


S Ser 
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4.75 


6.09 


TThr 
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7.08 


7.83 


VVal 
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10.80 


12.17 


WTrp 
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0.00 


0.00 


YTyr 
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2.54 
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B Asx 
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Tm 13.17 cDNA 



1 AOTGGATCCAAAGAATTCGCX2ACGAGACT 

M KLLCCL IS 

61 CCCTCATTC^TTGGTCACAGT^CAGG^ 

IiILIiVTVQA AL TEAQI EK LNK 

~~~ T Forward Primer 

121 AX3ATCAGCAAAAAATGTOVAAATGAAA 

■ISKKCQHESOVSQ-B I I * K A R 

181 GGAACGGTGACTGGGAGGACGATCCTAAACT 

NGDWEDDPKLKRQVFCVA RH 

241 ACG<!£!GGTCTGGCCACGGAATCG 

A G L A T E SGEVVVDVLREKVR 

301 GGAAGGTCACTGAC^CGACGAAGAAACTGAGAAAATCA^ 

KVTDN DEETEK I I NKCAVKR 
Reverse Primer 

3 gX ra ^ATAgTG^GAAGA Gftfc GGTGTO 

D T V E E TVFNTFKCVMKNKPK 

42 1 AGTTCTCACCAGTTGATTGAACCACCACGA 
F S P V D * 



481 ATATAAAA A TA A AGTGTTTCTGATGTAA AA A A A A A AA A AA AAA AAAAAAAA ACTC6 

polyadenylation signal poly <A) tail (26) 
53? AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTTCCACCC 



Forward Primer 



2-2 

Tm 13.17 
B2 

AFP-3 



LTDEQ! QKR 
LTEAQI EKL 
LTEEDLQLL 
E T P R E K L 



NKI SKECOOVS 
NKI SKKCONES 
RQTSAECKTES 
KQHSDACKAES 



G V S Q E 

G V S Q B 

G A S E A 

G V S E E 



t DKVRTGVLV 
I T K A R N GO WE 
I KKARKGDLE 
LNKVRNREEV 



2-2 

Tm 13.17 
B2 

AFP-3 



2-2 

Tm 13.17 
B2 

AFP-3 



D D 
D D 

DOPK 
DOPK 



PKMKKHVLCFSKKTGVATEAGDT NVEVL KAKLKH 
PKL KROVFCVARNAGLAT ESGE VVVDVL REKVRK 
KMQLLCI FKALEI VAESGEI EADTFKEKLTR 
KEHAFCI LKRAGFI DASGEFOLDHI KTKFKE 



VAS DEEVDKI 
VTDNDEETEKl 
VTNDDEESEKI 
NSEHPEKVDDL 



Reverse Primer 

VQKCVVKK 
I N K C A V K R 
VEKCTVTE 
VAKCAVKK 




AYDTFKCI YDS 
VFNTFKCVMKN 
AFEVTKCVLKD 
SADFFKCVHON 



.2-2 

Tm 13.17 
B2 

AFP-3 



K P D F S P I D 
KPKFSPVO 
KPNFFGDL FV 
R S 



Primer 



percent % composition 



MeltingTemperature (°C ) 



Forward 
Reverse 



23.6 14.3 42.9 14.3 44.0 
25.0 31.3 6.3 37.5 44.0 




3600 bp 



d^. 4.7 




(Mg. 4.8 



3-4 



%x4 



1 GGCACGAGCAA AA{ATjgA A ACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
t V i G A Q A T D E G I Q K 

92 A G G A A C A AGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RN K1 SKECQQVSGVS 

137 CAAGAGACGATCGACAAAGT CCGCACAGGT GTCTT GGT CGACGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAQAAQCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKM-KKHVLCFSKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGT GGAGGT ACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCAT GTGGC C A GCGACGAAGAGGTGGACAAGATCGTG 
KtK HVAS D EEVOKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVV KKATPEETAY 

361 GACACCTTCAAGGTTATTTACGACAGTAAACCTGATT'TCTCTCCT 
DTFKVI YDSKPOFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTT TGAC A AT A A A G G T 
I D * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



Molecular Weight 


12839.70 m.w. 


Length 


115 


1 microgram = 


77.883 pMoles 


Molar Extinction coefficient 


2920±5% 


1 A(280) = 


4.40 mg/ml 


Isoelectric Point 


7.14 


Charge at pH 7 


0.16 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


47 


46.41 


40.87 


Acidic (DE) 


20 


18.91 


17.39 


Basic (KR) 


20 


20.41 


17.39 


Polar (NCQSTY) 


29 


24.55 


25.22 


Hydrophobic (AILFWV) 


35 


28.04 


30.43 


A Ala 


6 


3.32 


5.22 


CCys 


3 


2.41 


2.61 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


G Gly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


I He 


6 


5.29 


5.22 


K Lys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


PPro 


4 


3.02 


3.48 


Q Gin 


6 


5.99 


5.22 


R Arg 


2 


2.43 


1.74 


S Ser 


7 


4.75 


6.09 


TThr 


9 


7.09 


7.83 


VVal 


15 


11.58 


13.04 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


X Xxx 


0 


0.00 


0.00 


. Ter 


0 


0.00 


0.00 



Predicted Amino Acid 



Composition of 3-4 



3-9 



1 GGCACGAGCAAA A lA T Gl A AACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGATGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGAGTCCGGAGTGTCC 
RNKI SKECQQESGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAGAACTGGA 
PKMKKHVLCFSKRTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCT GAAGCAT GT GGCCAGCGACGAAGAAGT GGACAAGAT CGT G 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPE ETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVt YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTTGAC A AT A A A G G T 
I D * 

polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 



poly (A) tail 



d$ia. 4. Ha, 



Predicted Amino Acid 



Composition of 3-9 



Analysis 


Whole Protein 


Molecular Weight 


12871.80 m.w. 


Length 


115 


1 microgram - 


77.689 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.23 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.31 


41.74 


Acidic (DE) 


20 


18.86 


17.39 


Basic (KR) 


20 


20.57 


17.39 


Polar (NCQSTY) 


30 


25.29 


26.09 


Hydrophobic (AILFWV) 


34 


27.20 


29.57 


A Ala 


6 


3.31 


5,22 


CCys 


4 


3.21 


3.48 


DAsp 


11 


9.84 


9.57 


EGlu 


9 


9.03 


7.83 


FPhe 


3 


3.43 


2.61 


GGly 


4 


1.77 


3.48 


H His 


2 


2.13 


1.74 


I He 


6 


5.28 


5.22 


K Lys 


17 


16.93 


14.78 


LLeu 


5 


4.40 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.77 


1.74 


P Pro 


4 


3.02 


3.48 


G Gin 


6 


5.97 


5.22 


R Arg 


3 


3.64 


2.61 


S Ser 


7 


4.74 


6.09 


TThr 


9 


7.07 


7.83 


VVal 


14 


10.78 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


X Xxx 


0 


0.00 


0.00 


-Ter 


0 


0.00 


0.00 



7-5 



1 GGCACGAGCAAA A jAT Gl A AACTCCTCTTGT GCTTTGCGTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAGTGCCAGCAGGTGTCCGGAGTGTCC 
RNKI SKECQQESGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QET I DKVRTGVL VDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKRTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGT GTATTTACGACAGTAAACCT GATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGGCTGAATTTT GAC AAT AAA GGT 
I D * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



Predicted Amino Acid 



Composition of 7—5 



Analysis 


Whole Protein 


Molecular Weight 


12843-80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.22 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


%by 


%by 


Amino Actd(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.19 


41.74 


Acidic (DE) 


20 


18.90 


17.39 


Basic (KR) 


20 


20.40 


17.39 


Polar (NCQSTY) 


30 


25.35 


26.09 


Hydrophobic (AILFWV) 


34 


27.26 


29.57 


A Ala 


6 


3.32 


5.22 


CCys 


4 


3.21 


3.48 


D Asp 


11 


9.86 


9.57 


EGlu 


9 


9.05 


7.83 


FPhe 


3 


3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


I lie 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


P Pro 


4 


3.02 


3.48 


Q Gin 


6 


5.98 


5.22 


R Arg 


2 


2.43 


1.74 


SSer 


7 


4.75 


6.09 


TThr 


9 


7.08 


7.83 


VVal 


14 


10.80 


12.17 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGIx 


0 


0.00 


0.00 


XXxx 


0 


0.00 


0.00 


.Ter 


0 


0.00 


0.00 
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NUCLEOTIDE SEQUENCES 
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PERCENT SIMILARITY 



3 



4 



5 6 7 8 



9 TO 



5* 



'1 




99.1 


97.3 


98.4 


98.2 


5Q.4 


42 


42.4 


2Q.7 


2Q..9 


1 


2-2 
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0.9 




99 


99 


99 


51.5 


43.2 


42.7 


20.2 


21.3 
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2-3 
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1.6 


60 




98 
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5Q.3 


42 


42.1 


20 


20 
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3-4 
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1.6 


20 


80 




98.4 


51.3 


43.7 


43.6 


20.4 


22 
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3-9 
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25 


75 


45 




50.4 


42.'5 


42.9 


22.2 


23.1 
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7-5 
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36.9 


36.4 


37.2 


36.4 


37.1 




57.2 


37.4 


23.3 


21 


6 


Tm 13.17 
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39.9 


41.2 


41.7 


40.7 


41.2 


30.2 




39.3 


21.9 


22.1 


7 


B1 
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41.8 


42.4 


43.1 


42.3 


4Z.3 


45.3 


49T4" 




21.8 


23.1 
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AFP-3 
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48.7 


58.6 


52 


51.2 


51.9 


61.7 


62.2 


58^2" 




45.6 
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YL-1 


10 


51.4 


62.7 


46.4 


46.5 


46.5 


60 


58.1 


61.8 


35^ 
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AMINO ACID SEQUENCES 
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11.6 
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37.4 


36.1 
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0.09 
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37.4 


35.2 


11.6 


12 
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7-5 
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46.1 


46.1 


47 


46.1 


46.1 




47.4 


39.8 


13.4 


13.9 
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Tm 13.17 
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59.1 


59.1 


60 


59.1 


59.1 


51.7 




37 


11.6 


11.1 
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B1 
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61.7 


61.7 
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60.7 


61.7 


60.2 


63 




10.2 
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AFP-3 
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86.7 


86.7 


85.7 


85.7 


85.7 


86.8 


84.2 


87?5" 




55.6 
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YL-1 
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88.3 


88.3 


86.4 


84.5 


85.4 


89.1 


82.7 


90-5 
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1 GGCACGAGCAAA A lAT GI A A ACTCCTCTTGTGCTTT G lC GTTCGCCGCd 

MKLLLCFA FAA 

P f G 

47 AT CGT C AT ql G GAGCTCAGGC T IC TCACCGACGAACA Gf A TACAGAAA 
J V 1 G A Q A At T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTC C b GAGTGTCC 
RNKI SK ECQQVSGV S 



137 CAAGAjGACGATCGACA A A GT C C G C A C A G GT GTCTT GGTCGAT GAT 
Q E TI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCF SKKTG 

226 GT GGCAACCGAAGCCGGAGACACCAATGTGGAG6T ACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

V 

316 CAGAAGTGCGTGGTCAAGAAGGCC tACACCAGAGGAAACGGCTTAT 
QKCVVKKA T P E E T A Y 

V Gr 

361 GACACC lTTCAAGTGTATTTACGACAGCAAACCTGATTT lCTCTCCT 
D TFKCI YDSKPDF ^ P ~ 

Gr / 



406 ATTGATTAAl TTGTTTTGT ATTTGACTGAATTTTGA CAATAA A GGT 
I D . 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



His -tagged Clone 2.2 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGG6G ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met 61 y Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCG TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lvs Leu Leu Leu Cys Phe Ala Phe Ala Ala He Val He Gly Ala 

-15 -10 -5 

N-terminal of mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 276 
Gin Ala Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser 

15 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 366 
Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAG GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 
He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGAC AAT AAA GGTAATA TCGTTATGTA AAAAAAAAAA 645 

AAAAAACTCG AGCACCACCA CCACCACCAC TGAGAT 681 



d^. 57 



His-tagged clone 2.2 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 366 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAG GTG GAC AAG ATC 411 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys lie 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro He Asp * 
115 



d^. 5.8 



His-tagged clone 2.3 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 

Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCT TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lys Leu Leu Leu Cys Phe Ala Phe Ala Ala He Val He Gly Ala 
-15 -10 -5 

N-terminal of Mature AFP 
CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 276 
Gin Ala Leu Thr Asp Glu Gin He Gin Lys Arg Asn Lys He Ser 

1 5 10 

AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 
Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr He Asp 
15 20 25 

AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 366 
Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 
30 35 40 

CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 
His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 
45 50 55 

GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 
Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 
60 65 70 

GCC AGC GAC GAA GAA GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 
Ala Ser Asp Glu Glu Val Asp Lys He Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 595 
He Tyr Asp Ser Lys Pro Asp Phe Ser Pro He Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGACAAT AAA GGTACTA TCGTTATGAA AAAAAAAAAA 645 

AAAAAAACTC GAGGACCACC ACCACCACCA CTGAGAT 682 



d^. 5.3 



His-tagged Clone 2.3 without signal sequence 



01 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His -tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 

Met Gly Ser Ser His His His His His His Ser 

-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 186 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin He Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 231 
Lys Arg Asn Lys He Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 276 
Ser Gin Glu Thr He Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 321 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 366 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAA GTG GAC AAG ATC 411 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys He 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 456 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 501 
Tyr Asp Thr Phe Lys Cys He Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro lie Asp * 
115 



(My,. 5 JO 



His-tagged Tm 13.17 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-65 -60 -55 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-50 -45 -40 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC TGG ATC CAA AGA ATT 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Trp He Gin Arg lie 
-35 -30 -25 

AFP Start Codon 

CGG CAC GAG ACT ACT AAG ATG AAG TTG CTC TGT TGT CTA ATC TCC 231 

Arg His Glu Thr Thr Lys Met Lvs Leu Leu Cys Cys Leu lie Ser 

-20 -15 -10 

N-terminal of mature AFP 
CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG ACC GAG GCA CAA ATT 276 
Leu He Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin He 
-5 15 

GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA 321 
Glu Lys Leu Asn Lys lie Ser Lys Lys Cys Gin Asn Glu Ser Gly 
10 15 20 

GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG 366 
Val Ser Gin Glu He He Thr Lys Ala Arg Asn Gly Asp Trp Glu 
25 30 35 

GAC GAT CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC 411 
Asp Asp Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn 
40 45 50 

GCC GGT CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG 456 
Ala Gly Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu 
55 60 65 

AGG GAG AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG 501 
Arg Glu Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu 
70 75 80 

AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG 546 
Lys He He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu 
85 90 95 

ACG GTG TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG 595 
Thr Val Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys 
100 105 110 

Stop Codon 

TTC TCA CCA GTT GAT TGA ACCACCACGA CTAGTAGATG GTTCAAATGG 643 
Phe Ser Pro Val Asp * 
115 

Polyadenylation signal Poly-A tail 
TGTGCTTTAC ATATAAA AAT AAA GTGTTTC TGATGTAAAA AAAAAAAAAA 693 

AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG CCCATCGTTT 743 

TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 777 



His-tagged Tm 13.17 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 

AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 

Met Gly Ser Ser His His His His His His Ser 

-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGC CTG ACC GAG GCA CAA ATT GAG AAA 186 
Gly Gin Gin Met Gly Arg Gly Leu Thr Glu Ala Gin He Glu Lys 
-5 15 

03 CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA GTG TCG 231 

^ Leu Asn Lys He Ser Lys Lys Cys Gin Asn Glu Ser Gly Val Ser 

0^ 10 15 20 

y3 CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG GAC GAT 276 

yl Gin Glu He He Thr Lys Ala Arg Asn Gly Asp Trp Glu Asp Asp 

25 30 35 

pj CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC GCC GGT 321 

Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn Ala Gly 
40 45 50 

CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG AGG GAG 366 
Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu Arg Glu 
55 60 65 

AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG AAA ATC 411 
Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu Lys He 
70 75 80 

ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG ACG GTG 456 
He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu Thr Val 
85 90 95 

TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG TTC TCA 501 
Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys Phe Ser 
100 105 110 

Stop Codon 

CCA GTT GAT TGA CTCGAGCACC ACCACCACCA CCACTGAGAT 543 
Pro Val Asp * 
115 
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